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Summary 


Duplex models can portray complex systems with the utmost of simplicity, clarity and 
efficacy. The drawcards range from the dearth of initial premises to the soundness of final 
conclusions. The mettle of the binomial approach shows up, for instance, in debunking the 
welter of myths and misconceptions that pervades the fields of finance and economics. 
According to the Efficient Market Hypothesis, the marketplace always reflects the totality of 
information available to the general public. Since every nub of know-what and know-how 
informs the latest prices, no single actor can improve on the valuation of assets ranging from 
stocks and bonds to commodities and realties. 

One consequence is the lack of trusty cues for forecasting the market: if every clue has 
been fully utilized, then any move henceforth has to come as a complete surprise. Another 
fallout lies in the Random Walk Model that pictures the path of the market as a form of 
Brownian motion whereby the price level is wont to shift in any direction with equal 
likelihood. 

Unfortunately, the Efficient credo abounds with flaws ranging from unreal assumptions 
and spurious concepts to inconsistent models and faulty conclusions. A counterpoint involves 
the wave motion of the stock market that belies the premise of utter randomness. As a 
recourse, a true science ought to build on hard data and staunch precepts, rigorous models and 
tenable results. To this end, the study at hand represents a small but fundamental step toward a 
coherent theory of the marketplace. 

To underscore the gulf between the mythos and reality, the work plan takes a minimalist 
approach. For starters, the inquest draws only on a minute fraction of the trove of information 
freely available at the most popular portal among the investing public. Moreover, the 
quantitative analysis relies solely on the simplest technique in statistical testing. From a 
computational stance, the attendant program invokes a skimpy subset of the built-in functions 
within the core module of the R system: the leading choice of programming language and 


software platform for data science in disparate domains. 


1. Introduction 


A duplex framework provides a lean and elegant way to model the dicey behavior of a 
complex system regardless of the domain. The crux of the methodology lies in the binomial 
test—the simplest tool in statistical analysis—for drawing conclusions with the greatest of 
ease, robustness and confidence. The mettle of the spartan approach shows up, for instance, in 
profiling the stock market: the epitome of chance and chaos in the world at large. 

According to the reigning school of finance and economics, the marketplace is so agile 
as to process the entirety of information available to the general public at once and employ the 
facts with perfect wisdom. This glossy image is known as the Efficient Market Hypothesis 
(EMH). Since the latest prices incorporate the totality of know-what and know-how, no one 
can improve on the valuation of assets ranging from stocks and bonds to commodities and 
realties. 

As a corollary, the EMH trumpets the absence of cues to predict the price level. If every 
factoid has already been absorbed, there is nothing left over for portending the market. Ergo, 
any move henceforth has to come as a complete surprise. Another consequent is the Random 
Walk Model that pictures the path of each asset as a form of Brownian motion whereby the 
price level moves up or down with equal likelihood. 

The attempts to describe the movement of assets based on the statistical physics of 
gaseous particles have spawned a deluge of literature that has scant bearing on the reality of 
the markets. Along with the false foundation of perfect efficiency, the entire edifice of 
financial economics stands on shaky assumptions and faulty arguments. The slip-ups begin 
with implicit and primary axioms such as ageless assets, utter rationality, and instantaneous 
action. The failings also appear in the explicit and secondary concepts such as utility 
maximization, perfect competition, and diminishing marginal productivity—all of which are 
empirically invalid and logically unsound (Keen, 2003). Hence the need for a fresh start based 
on solid data coupled with faithful models of the marketplace. 

In short, the regnant school of financial economics stands on a morass of fiction ranging 
from fanciful assumptions and feigned arguments to faithless models and forced conclusions. 


As a recourse, a wholesome approach ought to begin with a clean slate by building on hard 


data and realistic models. To this end, the current report represents a small but fundamental 
step toward a sound theory of the marketplace. 

To underscore the cleavage between the theory and reality, the work plan takes a 
minimalist approach while using the stock market as a case study. For starters, the inquest 
employs only a minute fraction of the treasury of information freely available to all comers at 
the most popular portal amongst the investing public. Moreover, the quantitative analysis 
relies solely on the simplest technique in hypothesis testing. From a computational stance, the 
requisite program invokes a sparing subset of the inbuilt functions within the kernel module 
of the R system: the top choice of programming language and digital platform for data science 


in the context of scientific assays as well as pragmatic pursuits. 


1.1. Objectives 


According to the central tenet of finance and economics, the marketplace is a model of 
perfection. In particular, the system is so efficient that it absorbs every whiff of information at 
once, thus leaving no further scent for forecasting the price level. As a corollary, every asset is 
deemed to move in a thoroughly random and unpredictable fashion. 

Given this backdrop, the current study overturns the entrenched doctrine with the 
utmost of simplicity, clarity and efficacy. For this purpose, the stock market represents the 
essence of flighty behavior that eludes the mass of practitioners and observers ranging from 
investors and analysts to academics and policymakers. 

A second goal is to demonstrate the power of an elementary tool in statistics for tackling 
an enigma that has long thwarted myriads of actors ranging from lone investors and fund 
managers to ivory-tower scholars and market regulators. The simplest technique in hypothesis 
testing, properly combined with other basal tools, can deliver decisive results at hefty levels 
of statistical import. Moreover, the binomial test used throughout this study is exact rather 
than approximate. That is, the criteria for testing the null hypothesis make use of firm and 
clear-cut numbers rather than iffy or fuzzy estimates. 

As a counterexample, consider a statistical tool that is far more popular than the 
binomial technique in far-flung domains ranging from scientific research to public policy. In 
deploying the t-test, the threshold for a critical region stems from a normal distribution. 
Unfortunately, the latter ogive is in general a mootable and uncertain template in modeling a 


complex system in the real world. 


By contrast, the binomial method makes no assumptions vis-a-vis the nature of the 
random process other than the independence of trials governed by fixed odds of bearing one 
or the other of a pair of disjoint outcomes. In the current study, the distinct turnouts consist of 
a gain or loss pegged by a suitable measure of performance. For this purpose, the basic form 
of the yardstick gauges the monthly return on investment to the upside or downside. 

A variation on the theme concerns the lead or lag in performance notched by one model 
against another in profiling the market. In the process, the binomial test does not require the 
outcomes to spring from a Gaussian process nor any other constraining function. For this and 
other reasons, the technique is straightforward, versatile and trustworthy. 

A third objective is to demonstrate the dearth of informatic skills needed to overturn the 
ruling gospel of financial economics. The case study highlights the functionality of the R 
system in building a simple but trenchant model of a convoluted system; namely, the stock 
market. Remarkably, a tiny subset of the core functions built into the software platform is 


enough to harness a problem of daunting complexity. 


1.2. Types of Disproofs 


From a tactical standpoint, the refutation of the Efficient dogma takes three forms. The first 
mechanism relies on proof by contraposition, as the testable properties of the EMH happen to 
contradict the actual behavior of the marketplace. The second type of rebuff involves proof by 
existence; in particular, a textured model of the bourse displays lithe patterns that differ 
markedly from the stiff and bland stencils embraced by the Efficient gospel. The third and 
related mode of rebuttal lies in proof by swpersedence whereby the refined model turns out to 
be demonstrably superior to the facile icons of the dogmatic creed. 

Any type of proof—whether by contraposition, existence or overclass—is by itself 
enough to annul the EMH. In that case, the three forms of confutation taken together pose a 
triple whammy of overwhelming evidence against the ruling credo of efficient markets. 

Furthermore, each class of proof may in turn bear a multiplicity of results. For instance, 
the tactic of supersedence reveals that the Sway Model supplants the Idle icon in terms of the 
imbalance of moves to the upside versus downside; and likewise for the average magnitude of 
the errors in profiling the monthly returns. The setup is similar for the double edge of the 


Sway framework in toppling the Drift makeshift. In these ways, the method of 


outperformance by itself delivers at least four distinct conclusions, each of which suffices to 
uproot the EMH and its offshoots. 

To sum up, the case study presents a nested system of multiform proofs anchored in 
multiplex dimensions. The meshwork of rebuffs represents a great deal of overkill since each 


of the elemental results is by itself enough to dispatch the myth of efficient markets. 


2. Background 


To set the proper context for the case study, this section surveys the key issues from a 
conceptual stance as well as a pragmatic slant. The topics on hand range from driving forces 
and knock-on effects in the marketplace to historical milestones and inborn flaws of the 


orthodox doctrine. 


2.1. Driving Forces 


The securities in the financial forum are issued by private concerns and public organs for a 
variety of reasons. An example involves a bond sold by a government agency as a way to 
raise funds for a special project, as in the case of building a dam or paving a highway. Another 
type of instrument concerns the equity of a company listed on a stock exchange in order to 
obtain cash for expansion or boost the firm’s visibility amongst the general public. 

In the financial arena, the price of each widget depends on the views of the participants 
regarding the security along with its issuer. As an example, a bond beckons the investor with a 
stream of periodic dividends followed in due course by the payback of the principal upon the 
scheduled date of maturity—assuming that the debtor remains solvent and has the capacity to 
discharge its obligations down the line. Meanwhile a share of stock conveys a slice of 
ownership in the current assets and future profits of the business as long as the firm survives 
the vicissitudes of the marketplace. 

The owners of bonds have priority over those of stocks. If a company were to declare 
bankruptcy, liquidate its assets, and close its doors, then the bondholders would have first 
rights to any of the surplus that remains. By contrast, the shareholders are entitled to the 
leftovers which may well amount to little or nothing at all. Due to the specter of hefty losses 


for the shareholders, stocks tend to be more volatile than bonds. In exchange for the greater 


risk in store, however, the shares offer the prospect of plump gains by way of capital 
appreciation. In these ways, the main draw of a stock is the potential for capital gains while 
that of a bond is the promise of cash dividends. 

The valuation of any asset depends greatly on the conditions in the marketplace. For 
instance, a spot of good news may lift the price of a stock while a stroke of bad luck might 
sink it. If a company is to remain viable, however, its gross revenues and net earnings must 
expand over time if only to keep pace with the bloat of inflation. Since a share of equity 
represents a claim on the intake of profits, the widget ought to appreciate in tandem. 

Along the way, however, the stock will also gyrate with the upthrows in the environs 
including the real economy and financial forum, political theater and cultural milieu. An 
example involves the total volume of sales or earnings in a sizable swatch of the marketplace. 
For instance, certain markets surge in the winter as in the likes of heating oil or ski equipment, 
children’s toys or tropical resorts. 

The story is similar on the financial front. As a backdrop, millions of workers in the real 
economy enjoy perks by way of cash bonuses as the year draws to a close. In the wake of the 
windfall, a portion of the bounty finds its way into the financial tract thus lifting the demand 
for virtual assets ranging from singular stocks to composite funds. 

In these and other ways, a plethora of cycles prevail in the tangible economy as well as 
the financial bazaar. As a result, there is plenty of reason to suppose that the assets in the 
marketplace should flow and ebb with the seasons of the year. On the other hand, we would 
not expect the cyclic pattern to be highly obvious nor strictly regular. As we shall see, any 
conspicuous motif will vanish in short order as a side effect of the wheeling and dealing by 
the hustlers who take advantage of the opportunities for easy profits. 

Despite the tricky factors, however, the meaningful question is not whether any patterns 
exist at all in the financial realm. Rather, the mindful query concerns the nature and extent of 


the motifs along with the degrees of transparency and tractability. 


2.2. Orthodox View of the Markets 


According to the conventional school of financial economics, the actors in the marketplace 
make full use of the information available to the general public. An example concerns the 
history of prices for a singular stock or the volume of transactions for the entire bourse. By 


this image of efficiency, the entirety of public information informs the current price of each 


asset (Fama, 1965). Since all the intel is exploited to the max, no one can augur the market 
beyond the latest price. 

From a historical stance, the origins of the EMH and Random Walk may be traced at 
least to the onset of the 20" century. The watershed was a mathematical model of Brownian 
motion along with its application to the valuation of stock options (Bachelier, 1900). 

By the autumn of the same century, the Efficient Market Hypothesis came to comprise a 
triplex of overlapping variants: weak, semi-strong, and strong forms. The strength of each 
version refers to the tightness of the constraints along with the degree of dubiety. 

According to the weak form, the prices of traded assets—ranging from stocks and bonds 
to commodities and real estate—teflect the totality of information from the past which is 
available to the general public. In that case, for instance, the historical record of stock prices is 
useless in foretelling the changes in price. 

Meanwhile, the semi-strong version of the EMH asserts not only that each asset reflects 
all the intel available to the public but that the price adjusts posthaste to incorporate brand- 
new motes of public information. Put another way, only an operator armed with confidential 
information may gain an advantage in the arena. 

Finally, the strong form of the doctrine asserts in addition that the price of each widget 
subsumes even occult facts hidden from the general public. A specimen lies in the latest tally 
of global revenues known only to the top executives of a conglomerate. 

From a pragmatic stance, the strong form of EMH is clearly unrealistic. In fact, the 
unfair advantage bestowed by confidential information has given rise to regulations against 
exploitive behavior such as insider trading by the directors of a corporation. 

By the weak form of the Hypothesis, current prices reflect all the public information 
available in the past and present. In that case, for instance, there is no way to outpace the 
market averages by sifting through historical data. 

This variant of the doctrine is weak in the sense that it embodies fewer assumptions than 
the other two versions, but is actually the strongest in terms of its overreach. Put another way, 
the frail version of the EMH is the most plausible of all—or more precisely, the least 
implausible. For this reason, the weakest in a formal sense is the strongest from an empiric 
stance. 

Meanwhile, the semi-strong form of the doctrine contends that fresh jots of news 
available to the public are incorporated promptly into the price level. In that case, no single 
actor can beat the market in a consistent fashion by reacting to the bulletins on the news 


wires. 


To be fair, we should note that the orthodox faith does offer a concession in terms of 
outpacing the benchmarks of the market. In particular, the expected return on investment is 
deemed to rise with the risk entailed. In the financial community, the principal measure of risk 
lies in the volatility of the price level. A common example is the standard deviation of 
monthly returns chalked up by the S&P Index. 

In competing against a market index, an investor may earn outsize profits by procuring 
a risky asset. According to the hallowed theory, however, a surfeit of risk is the only way to 
garner an excess return. By merging the notions of perfect rationality and risk dependence, the 
EMH asserts that it is impossible to outrace the market after adjusting for the volatility of the 
assets. In other words, the only way to overtake the market at large is to overflow the risk in 


store. 


2.3. Outgrowths of the Efficient Dogma 


The quintessence of the EMH and the Random canard lies in the stock market where millions 
of entrants vie with each other to earn superior returns. If the odds of rising and falling are 
equal, then the current price is the best guesstimate of the price level going forward. This 
shibboleth is a direct outgrowth of the Random Walk mockup from a pragmatic stance. We 
shall refer to this caricature of the market as the Jdle Model. 

At this juncture, we should take a step back and consider the big picture of the stock 
market over the long range. Even a casual student of the bourse can see at once that the price 
level has a way of climbing higher as the decades go by. In that case, a plausible model of the 
bourse ought to take account of the large-scale trend. 

For this reason, a variation on the theme of efficiency asserts that the upward tilt 
represents the best guess going forward. As an example, the optimal forecast of the price level 
a month hence is the current price plus a fitting increment of the gradual migration. We shall 
refer to the trending version of the EMH as the Drift Model. 

Since the autumn of the 20" century, a sea of literature has emerged in support of the 
EMH and its outspreads. Along the way, the mirage of utter efficiency has established itself as 
the premier model of financial economics. Unfortunately, the picture of perfect rationality 
contravenes the reality of the marketplace. 

A primary factor involves the challenge of survival in a harsh and unforgiving 


environment. Sadly, death is the way of life for all manner of firms in the real world. For this 
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reason, the best forecast of the price level for any stock over the long run is a breakdown 
followed by a demise. 

The hidden premise of everlasting assets lies behind another ubiquitous myth in the 
financial realm. In the received wisdom of amateurs as well as professionals, a market index 
tracks the fortunes of the membership over all time frames, from less than a day to more than 
a year. 

In truth, however, a mutable benchmark such as the S&P Index or the Dow Jones 
Industrial Average does not mirror the lot of the constituent firms nor their stocks over the 
long range or even the medium term. Rather, a yardstick of this kind merely reflects a relay 
race run by a succession of sprinters in the prime of their lives. To adapt the words of a well- 
known bard, each member of the ensemble is merely a shadow, a poor player that struts and 
frets its hour upon the stage and then is heard no more. 

In line with these examples, a plethora of fairy tales hold sway in the real and financial 
marts. On one hand, a balanced review of the myths and mistakes lies beyond the scope of 
this report. Even so, a systematic survey of the passive mistakes and active bungles in the 


field is available elsewhere (MintKit, 2020). 


2.4. Flaws of the Efficient Canon 


The cult of efficiency has been fingered as the source of numerous ills in the marketplace in 
the modern era. A showcase cropped up during the financial crisis of 2008 along with the 
Great Recession that ravaged the global economy. The cohort of critics against the Efficient 
mantra continues to swell. For instance, a former chairman of the U.S. central bank openly 
decried the dogma by declaring, “It should be clear that among the causes of the recent 
financial crisis was an unjustified faith in rational expectations, market efficiencies, and the 
techniques of modern finance.” (Volcker, 2011) 

In addition to flouting the facts from the get-go, the EMH suffers from a host of internal 
flaws in logic. For instance, the catechism begins with the concession that any investor in 
particular is imperfect and thus unable to make flawless decisions. Yet, the polemic leaps to 
the conclusion that a large number of humans can—and will—in the aggregate make faultless 
decisions at all times. 

Unfortunately, the binge of hand waving flounders from a conceptual stance as well as a 


pragmatic slant. From a logical standpoint, a finite number of imperfect beings cannot 
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produce perfect results without fail. To presume otherwise is to imagine that the imperfections 
will cancel each other out on every aspect of every market. In other words, the veiled premise 
behind the EMH is a flight of fancy that breaches the bounds of reason. 

Moreover, the ideology is at least as groundless from an empirical stance. In fact, the 
opposite of the dozy notion usually prevails in the real world. For instance, the mass of 
investors routinely violate the strictures of logic and even the dictates of common sense. A 
glaring example shows up in the frothy prices at the height of a bubble or the pulped assets in 
the depth of a panic. And in between, the market has a custom of lurching back and forth, 
from a state of being overpriced to undervalued and back again. 

Amid the endless cycles of blowup and meltdown, a tenacious streak shows up in the 
custom of the stock market for swerving in tune with the seasons of the year. We will christen 
this portrait of the market as the Sway Model. 

To sum up, the Efficient Market Hypothesis undergirds the entire corpus of financial 
economics. Yet, a modicum of data screened with dispassion is enough to refute the dogma of 
efficiency. In the imperfect world we inhabit, perfect rationality is a sheer figment of the 


imagination. 


3. Method 


As we have seen, the Efficient Market Hypothesis denies the reality of the real and financial 
markets in manifold ways. For this reason, the doctrine may be refuted through a variety of 
means. A basic approach centers on the discrepancy between the surmised nature versus the 
actual tenor of the markets. 

A case in point is the failure of the price action to follow the Gaussian distribution 
entailed by the Efficient mythos. In this light, a trickle of impartial studies since the autumn of 
the 20" century has shown that the movements of the market belie the straitjacket of a normal 
density function (Mandelbrot, 1963; Mantegna and Stanley, 1997; Vasiliki et al., 2000). 

On the downside, though, the landmark publications to date suffer from an excess of 
complexity. More precisely, the disproofs of the EMH and its spinouts rely on reams of jargon 
ranging from arcane symbols and fancy equations to abstruse procedures and baroque 
arguments. The convoluted nature of the exposés has doubtless played a major role in 


blunting the impact of the revelations. Not surprisingly, the Efficient dogma continues to 
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predominate in academic cloisters as well as pragmatic circles despite the abundance of 
evidence to the contrary in the world at large. 

The dearth of progress to date is a good reason to adopt a fresh approach to the subject. 
In this spirit, the case study at hand pursues a plain and intuitive track based on the seasonal 
patterns in the marketplace while drawing on a lean set of elementary tools from the fields of 
Statistics, programming and software. From a practical slant, the S&P 500 Index serves as the 
leading beacon of the stock market for professionals, be they practitioners in the trenches or 
researchers on the sidelines. The lodestar thus serves as the benchmark of choice for the 


current study. 


3.1. Features of the Dataset 


The actors in the marketplace confront a broad range of time frames. For instance, the short 
end of the spectrum applies to the day trader who buys and sells stocks within the span of a 
few seconds or a handful of hours. Meanwhile the long end of the range pertains to the aloof 
player who sets up an automated system to transfer funds from a bank account to a mutual 
fund for the purchase of shares every month, then ignores the antics of the market for years or 
decades on end. 

Despite the breadth of timespans, however, myriads of investors keep track of the 
bourse from one month to the next. For this reason, sampling the market on a monthly basis is 
a common practice in financial circles. An example involves a review of alternate strategies 
for investment over the past couple of decades based on the average payoff per month relative 
to the risk entailed. In keeping with the fond choice of practitioners as well as researchers, the 
current study takes up a sampling period of a single month. 

A second and related facet concerns the timing of observations within the monthly 
cycle. For instance, the readings could be made at the beginning of the period, the end of the 
spell, or some point in between. Once again, the financial community subscribes to a standard 
practice. The seasoned players in the ring regard the price level at the end of each period—be 
it an hour or a day, a month or a year, or any other window—to be the culmination of the 
activity throughout the entire interval. In other words, the closing price serves as a roundup of 
the price action in more ways than one. 

As we noted earlier, the S&P 500 Index (SPX) takes pride of place among the 


benchmarks favored by the professional community ranging from fund managers to academic 
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researchers. After evolving through a series of variants, the touchstone took its modern form 
in 1957: a rolling roster of 500 heavyweights in the stock market. The SPX was calculated 
back to 1926, even though the benchmark did not exist back then. 

From a different slant, the average level of the Index between 1941 and 1943 was set to 
a baseline of 10 units. That way, the value of the benchmark upon its launch in 1957 was 
comparable to the price of a representative stock which hovered around $45 to $50 at the time 
(Siegel, 1998, p. 60). 

By the same token, the SPX was reckoned to be 16.66 points at the onset of 1950. Given 
this background, a seemly stretch of the historical record in the modern era spans a window of 
70 years starting in 1950. 

To recap, a wonted practice in the professional community is to regard the S&P Index as 
a proxy for the stock market as a whole. A second habit is to monitor the returns on a monthly 
schedule. A third praxis is to treat the price level at the end of each period as a representative 
value for the entire interval. In keeping with these customs, the current study employs the 
closing prices of the S&P benchmark on a monthly basis. The data come from Yahoo Finance, 


the most popular site among the investing public (Yahoo, 2020). 


3.2. Proper Metrics 


In the financial forum as in the world at large, a routine practice is to reckon the value or 
status of an instrument by way of an absolute figure. An example involves a singular stock 
priced at $9.28 per share, or a compound index pegged at 387.45 points. From the standpoint 
of an investor, however, the absolute level by itself has little or no meaning. Instead, the raw 
price is a means rather than an end in the decisions they make and the actions they take. 

To bring up an example, consider a stock named Alpha which trades at $14.83 per share. 
Is the latter price high or low, good or bad? The quandary is similar for a market index named 
Omega which rose by 10 points over the past year. Was the upturn a slugger or a slacker, a 
windfall or a letdown? For neither instrument can anyone offer a cogent answer based solely 
on the information at hand. 

To determine whether Alpha is overpriced or not, for instance, we need to obtain more 
facts. An example of the latter concerns the total value of the company along with the number 
of shares outstanding, or the price level today compared to the unit cost a year ago. Ina 


similar way, an uprise of 10 points for Omega happens to be a respectable showing if its value 
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at the outset was 50 points—in which case the payoff amounted to 20% in a single year. But 
the same turnup was paltry if the starting figure was 500 points thus yielding a capital gain of 
just 2%. 

In line with these vignettes, the ultimate yardstick of performance for an investor is the 
relative payoff over time. An example involves the fractional change in price notched by a 
singular stock in a given year, or the relative gain etched by a compound index from one 


month to the next. 


3.3. Treatment of Null Returns 


The financial forum finds itself in a continual state of flux as every asset swells and swoons 
with the passage of time. Despite the gyrations along the way, however, the price level may 
settle at the same point from one period the next. There are several ways to handle a null 
return in building a bare-bones model that admits only a pair of dichotomous outcomes, 
whether to the upside or downside. 

One approach is to ignore the datum entirely. In that case, a value of zero is treated as an 
aberration and excluded from further consideration. That is, the oddity is branded as “Not 
Applicable” and ignored in calculating the mean, median, or any other digest of the sample. 

A second recourse is to assign a value of nil to each of the opposing groups. For 
instance, a reading of zero could be reclassified as half an instance of a positive return and 
half a case of a negative gain. 

A third method makes use of the domain knowledge pertaining to the case study. As a 
backdrop, we note that the purpose of venturing into the bourse is to earn a profit by way of a 
capital gain and/or dividend yield. On the other hand, many a stock pays little or nothing by 
way of cash payouts. For this reason, the prospect of capital gains is in general the main 
reason for trafficking in stocks. 

To bring up a counterpoint, an investor whose primary goal is the preservation of wealth 
in nominal terms could stash their money in a savings account at a commercial bank. In that 
case, the saver would enjoy a steady stream of interest payments as well as get their principal 
back in due course—all with much less heartburn and far greater assurance than holding a 
stake in the stock market. 

An alternate ploy is to buy a bond and hold it till the date of maturity. By this means, a 


bondholder in the credit market can be much more confident of retrieving the principal 
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committed at the outset in comparison to a shareowner in the equity market. Moreover, the 
dividend yield on a bond is apt to exceed the interest rate on a savings account at a bank. 

Despite these facts of life, however, the chestnut of efficiency brushes aside a vital 
aspect of the bourse by averring that the price level is wont to rise or fall with equal odds and 
to like extent. If the expected return were in fact zero, why would any prudent soul—let alone 
a throng of godlike beings endowed with perfect wisdom—accept the threat of a wipeout in 
the stock market when they could enjoy a steady stream of earnings from a savings account or 
a government bond, as well as regain the principal in due course, without incurring any risk to 
speak of ? 

The blindness to domain knowledge is a profound way in which the EMH lacks verity 
and lucidity, not to mention relevance and credence. To believe in the pretense of null returns 
on average is to deny the very purpose of investing in the stock market in the first place. 
Moreover, the mummery of a Random Walk gainsays the reality of the bourse which in fact 
has a way of trudging higher over the long range. 

To return to the main theme of this section, we note that an investor who encounters a 
stagnant market has failed to fulfill their aim of earning a capital gain. In other words, a null 
change in price is a disappointment for the risk-taker. From a pragmatic stance, then, a flat 
turnout is tantamount to a failure, much like a negative return. For this reason, only a positive 
return counts as a success for the serious player. 

In brief, the stock market has a way of bouncing around over all time scales, from less 
than a day to more than a year. In spite of—and due to—the tireless rambling, a widespread 
practice in the financial community is to track the movements on a monthly basis. 

To bring up another custom, the S&P Index is usually reported by the financial media to 
the accuracy of two decimal places. In such a tally, the last decimal represents a small fraction 
of the spread of prices that usually crops up during a single day. In that case, the proportion is 
of course even smaller in relation to the range of prices observed over an entire month. For 
our dataset, the values have been recorded to the sharpness of six decimal places; as a result, 
the chance of a repeat performance from one month to the next is paltry indeed. 

As the market turns and churns, it displays a slim chance of a null change from one 
period to the next. In our sample, there is only a single case of repeat performance. The S&P 
benchmark notched up 109.320000 points at the end of August 1979; after bouncing around 
for a while, it closed at the same level the following month. 

To round up, a sober investor ventures into the stock market and bears the risk of loss in 


the hope of earning a profit. In that case, a null return is a letdown for the risk-taker. For this 
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reason, we will regard a turnout of zero percent as a failure rather than an ignorable case, and 
certainly not as a success. 

From a pragmatic stance, however, any reasonable approach to handling a null return is 
apt to be admissible. Given the rareness of a flat spell in the market, the special case is 
unlikely to have a material impact on the overall results regardless of the particular mode of 


treatment. 


3.4. Choice of Statistical Test 


As we noted earlier, the binomial test imposes no constraints on the nature of the process 
under investigation other than the fixed odds of observing a pair of disjoint outcomes. For 
starters, each trial or experiment gives rise to one of two outcomes. Secondly, the trials are 
mutually independent. Thirdly, the probability of each outcome is the same for the entire 
sample. In other words, the only source of variation is the innate randomness of a binary 
process where the odds of the polar events remain constant. 

According to the Random Walk bromide, the price level on the next step can only move 
upward or downward. Secondly, the chance of a rise or fall at each stage does not depend on 
the current price nor any other aspect of the market; in particular, the prospect of an upstroke 
on each occasion is unaffected by the turnouts of other periods. Thirdly, the probability of an 
upcast remains fixed across all episodes. 

These three conditions are fully aligned with the assumptions behind the binomial test; 
namely, a set of independent trials each of which abides by fixed odds of assuming one of two 
disjoint outcomes. As a result, the binary probe is eminently suited for vetting the EMH and 
its byproducts. 

If we look at the big picture, the binomial test is conservative in the following sense: the 
technique is less likely than other methods to detect a departure from the null hypothesis even 
when such an aberrance does in fact exist. Ironically, the reason has to do with the liberal 
nature of the binal checkup. 

To begin with, we note that a system marked by a single fixed outcome at all times 
amounts to a certain process rather than a chancy one. For this reason, a random process of 
any kind has to display at least two distinct states. 

The other assumptions behind the binomial test are also sparing and lenient; namely, the 


independence of trials coupled with static odds of observing one outcome versus its 
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complement. In this way, the binal framework adopts an expansive view that admits a 
plethora of flighty systems. 

If nearly everything is unremarkable, then almost nothing is exceptional. Due to the 
broad embrace, only an extreme oddity may fall outside the scope of admissible schemes. For 
this reason, the binomial test is often unable to detect anomalies even when they can be 
spotted by other techniques. Yet this very shortcoming represents a keen virtue for the case 
study at hand. If a binate assay—in spite of its accommodative nature—happens to reject the 
null hypothesis, then we may accept the unusual verdict with a high degree of surety. 

In the current context, the random process involves the movements of the stock market 
from one month to the next. If the return in any given period is positive, the outcome is 
treated as a success; and otherwise as a failure. A duplex model of this kind readily serves as a 
testbed for the EMH along with its spinouts in the form of the Idle and Drift Models. 

The setup is similar for the Sway Model which presumes that the odds of overshooting 
or underrunning the average return in a particular month remain fixed from one year to the 
next. Moreover, the outcome depends only on the pertinent month and is independent of the 
other portions of the year. 

To sum up, the binomial test comes with a bevy of attractions. For starters, the 
technique is an exact method rather than an approximate one. That is, the criteria for vetting 
the null hypothesis make use of precise values rather than rough estimates. Moreover, the 
binal framework is so elemental that it imposes scarcely any constraints on the underlying 
process. As an example, no parameter has to be presumed in advance nor inferred from the 
data other than the very statistic being tested; namely, the proportion of successes, or 
equivalently the ratio of odds for a couple of opposing outcomes. 

The upshot is a nonparametric tool that can handle any reasonable form of distribution 
function lying behind the binate process. For instance, the binomial test does not require the 
observations to spring from a Gaussian distribution nor any other prefixed function. For a 


variety of reasons, then, the procedure is highly robust as well as trustworthy. 


3.5. Choice of Software Environment 


The R language is a handy medium for crunching data and gleaning statistics. For example, 
the programming lingo is backed by an open-source platform that offers a wealth of 


functionality. The software kernel—also known as Base R—may be enhanced by installing 
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one or more extensions, each of which is called a package. For this purpose, an online 
resource called the Comprehensive R Archive Network stockpiles a myriad of plug-ins; at last 
count, the number of boosters surpassed 16,000 (CRAN, 2020). The add-on modules range 
from routines for conducting advanced statistical tests to utilities for streamlining the analysis 
of time series data. 

In keeping with the minimalist approach, however, the current study refrains from using 
such turbocharged tools and relies only on a scanty subset of the basal functions built into the 
core platform. The entirety of data formatting, graphic output, and statistical grilling is 
effected through a pithy program comprising several dozen lines of code in the R language, 
along with a modicum of explanatory remarks which are provided solely for the benefit of the 


human reader rather than the steerage of the computer system. 


4. Profiling the Data 


In tracking a market index, we could focus on one or more aspects of the price action. An 
example lies in the price level at the beginning of each month; or perhaps the mean value of 
the high and low prices for each period. In these and other ways, a number of choices are 
available. Even so, a standard practice in the financial community is to sample the price at the 
end of each interval. We will abide by this custom and monitor the closing values on a 
monthly basis. 

As a warm-up task, we plot the data to obtain a global view of the price activity. The 
resulting graphic in the form of Fig. | makes clear that the price level expands over time 
despite a series of sizable upswings and downstrokes along the way. 

On the whole, the Index appears to grow at an exponential rate over the entire timeline. 
That would be a reasonable turnout since the benchmark comprises a weighted average of the 
stock prices for a rolling roster of the biggest and stoutest firms on the bourse as they come to 
the fore then duly fade away in turn. Moreover, each share of stock represents a slice of 
ownership in the current assets and prospective profits of the underlying company. At a 
minimum, then, the equities ought to keep pace with the march of inflation that tends to 


advance by a couple of percent or more per year. 
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Fig. 1. Monthly level of S&P Index from January 1950 to January 2020. 


Our next task is to profile the large-scale trend by plotting the logarithm of the original 
values. In Fig. 2, the vertical axis depicts the log to base 10 of the closing prices. This chart 
suggests that the overall trend is more or less steady. We thus infer that the original variable 
displays an exponential streak. Since the SPX has a way of swelling over time, the next step is 
to extract the conspicuous trend before gleaning any subtler patterns. 

In line with earlier remarks, the price level is a means rather than an end for the mindful 
investor. In gauging the relative payoff over time, we focus on the fractional changes in price 
on a monthly basis. As a bonus, sizing up the return on investment in a stepwise fashion has 
the helpful effect of incorporating the long-range trend. 

Following the qualitative scan of the dataset, we proceed to a quantitative probe. As a 
lead-in, we reckon the fractional changes in price from one month to the next over the entire 
timeline. At each stage, we calculate the relative gain—also called the forward return—over 
the next period as a percentage of the current price. The turnout in the form of Fig. 3 shows 
that the returns vary a great deal, especially after the first hundred cases or so. Despite the 
dispersion, though, there appears to be no material change in the scattering pattern across the 


entire stretch. 
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Fig. 2. Log to base 10 of SPX by month over the entire timespan. 
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Fig. 3. Forward monthly return for SPX from 1950 to 2019. 
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The forward return for December 2019 takes account of the price levels at the end of 
that month as well as the close of the following month; namely, January 2020. For this reason, 
it would be incorrect to regard the original dataset as a muster of readings spanning a window 
of precisely 70 years. Even so, it would be correct to say that the working dataset covers 


exactly 70 years’ worth of monthly changes in price starting with the onset of 1950. 


5. Crafting the Models 


This section presents a trio of well-defined models. Two of the templates represent the 
Efficient doctrine while the third depicts the seasonal cycle of monthly returns throughout the 
year. To this end, our first task is to convert the airy visions of the Efficient mantra into a 
couple of concrete models. 

We begin with the Idle icon which presumes that the market rises or falls with equal 
likelihood and to similar extent in either direction. Since the expected moves to the upside and 
downside are perfectly balanced, the best forecast of monthly returns at any stage is fixed at 
zero percent. 

On the other hand, the setup differs somewhat for the Drift motif. The latter mockup 
does rely in a small way on the actual mien of the market; namely, the uptrend over the long 
range. For the dataset at hand, the geometric mean of the monthly gains amounted to 
0.6260826 percent. 

Our next task is to divine the seasonal patterns in the stock market. For this purpose, we 
discern the average return by month throughout the year then display the result as Fig. 4. The 
graphic shows that, more often than not, the mean return had a positive bent. In particular, the 
biggest gains occurred in March, October and November. Moreover, the market hardly moved 
on average during January, April, May and July. On the downside, the worst performance 
cropped up in August. 

For a richer view of the dataset, we generate a boxplot of the forward returns for each 
month and display the result as Fig. 5. We should keep in mind that the thick bar within each 


rectangle represents the median level rather than the mean value for the pertinent month. 
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Fig. 4. Mean return for each month of the year. 
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Fig. 5. Boxplots of forward monthly returns in percent. 
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The thumbnail for each period reflects a great deal of dispersion compared to the 
variation in median values throughout the year. Most of the boxplots in Fig. 5 depict one or 
more outliers as shown by the circles lying beyond the whiskers. The only exceptions are the 
sixth and twelfth months—namely, June and December—which lack any extreme values. 

As we noted earlier, the dispersion of entries within each month is sizable compared to 
the disparity of median values across the groups. Despite the broad scatter of readings along 
with the attendant clutter, however, some differences in the average values are apparent. 

For instance, the median return for October appears to surpass those of September and 
November. Meanwhile, an example on the flip side applies to August whose average gain falls 
below those of July and September. From a larger stance, the relative placement of the 
median values is on the whole similar to the configuration of mean returns that we saw in 
Fig. 4. 

At this juncture, we wish to determine whether the mean or the median statistic provides 
a better fit between the model and the data. We will refer to the vector of mean returns by 
month as the zig array. By the same token, the zag vector denotes the lineup of median 
returns. The mean value of the absolute errors for the zig vector came out to 3.081209 
percent while that for the zag was 3.067265. Meanwhile, the median value of the absolute 
errors for the former vector amounted to 2.440439 percent while that for the latter was 
2.472576. 

Based on these results, the mean level of absolute misfits for the zig array was slightly 
larger than that for zag. On the other hand, the median value of the absolute residuals for the 
zig vector fell slightly below that for zag. 

In short, the zag vector outperformed the zig array by a small amount in terms of the 
mean level of absolute errors while the converse was true by way of the median values. Based 
on the mixed results, there was scant reason to choose one metric over the other. As a rule, 
however, we know that the median statistic is more robust than the mean yardstick. For this 
reason, the zag vector rather than its zig rival was adopted as the backbone of the seasonal 


model. 
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5.1. Central Role of Errors 


According to the EMH, the current valuation of each asset incorporates every hint of future 
events. Ergo, the price level cannot be foretold except perhaps by happenchance. One spinout 
of the EMH is the equality of odds for the market to rise or fall from one period to the next. 
Granted, the very nature of a dicey system means that the number of upstrokes is unlikely to 
precisely match that of downthrows during any given interval. Even so, the disparity should 
not be so pronounced as to raze the premise of equality at an ample level of statistical 
significance. 

The verity of a model may be judged by the randomness of the errors that ensue. As a 
counterpoint, a palpable pattern within the residuals implies that the model has failed to 
capture one or more salient aspects of the target system. With this truism in mind, we turn to 
an informal survey of the residuals through a graphic display as a prelude to a quantitative 
probe. 

As we noted earlier, the Idle Model presumes that the expected value of the price level 
going forward is equal to the current state. In that case, the implied forecast at any stage is a 
null change in price. As a byproduct, the actual return over the subsequent month reflects the 
mantic error for that stage. 

At this point, we should clarify an item of bookkeeping. In everyday language, the error 
of an estimate refers to the difference between a foretold value and the actual amount. To 
bring up an example, consider a bag of candy which by your reckoning contains 30 pieces. 
After a closer inspection, though, you find that the bag holds 27 nuggets. In that case, your 
overestimate was 30 minus 27, which equals 3. Put another way, your error came out to 
positive 3 units. 

On the other hand, a friend of yours may have opined that the same bag contains 23 
pieces. In that case, the underestimate amounted to 23 reduced by 27 which comes out to 
minus 4. That is, the slip-up was negative 4 units. 


In either case above, the miscue amounted to the following: 


error = estimate — actual 


By contrast, the custom in statistics is to define the discrepancy in an indirect way by using 


the following relationship: 
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actual = estimate + error 


In other words, the error is the amount needed to reconcile the estimated value with the actual 


observation. We may rewrite the foregoing equation as follows: 


error = actual — estimate 


In comparing the two expressions in bold for the error term, we find that the second 
formula is precisely the negative of the first. In this way, the statistician takes a contrary view 
of the lapses in comparison to a normal person. Despite the aberrant behavior, we will abide 
by the formal custom and define the error as the difference between the actual value and the 
prior estimate. 

To bring an example, the Idle icon presumes that the best forecast of the forward return 
is a big fat zero. Since the estimate is always zero, the miscue due to the Idle motif at each 


step 1s: 


error = actual — estimate = actual 


We will use the latter result for the sake of propriety. In practice, though, the choice of sign 
will make scant difference since we will work mostly with the magnitudes of the residuals. 

To move on, we note that the Drift mockup pictures the market in the same way as the 
Idle version except for a steady uptrend in price. Based on the convention described above, 
the ensuing error at each stage is the difference between the actual return and the geometric 
mean that we deduced earlier. 

On the other hand, the Sway Model captures the flutter of the market throughout the 
year. For this reason, the implied forecast at any stage is the average change in value for the 
corresponding month. Moreover, the gap between the actual return and the distilled value 
represents the error for that period. 

The results thus obtained may be displayed compactly by way of a boxplot for each of 
the error vectors. Fig. 6 shows that the average values of the residuals scarcely varied in 
comparison to the full scatter of misfits. 

A cursory scan of such an image could easily lead a casual observer to surmise that 


there is no meaningful difference among the error profiles. Despite the merits of a graphic 
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display, however, a visual survey of the data can be misleading; hence the need for a rigorous 
assay by way of a quantitative workout. As a starting point, we compare the dispersion of 


errors due to each model. 
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Fig. 6. Boxplots of errors by model. 


5.2. Scatter of Errors 


In appraising the Efficient doctrine, it often makes no difference whether the Idle Model or 
Drift version is chosen as the archetype. The only distinction between the two templates is a 
fixed level of monthly returns. More precisely, the implicit forecast due to the Idle icon at 
each stage is precisely zero while that for the Drift motif is a fixed rate which happens to be 
slightly positive. Either way, the pattern of miscues around their central values will be 
identical for the duo of Efficient models. 

In gauging the fitness of a model, an obvious tack is to compute the variance of the 
residuals. The resulting tally for the Idle Model amounted to 16.85874. Since the variance of a 
sample is unaffected by a linear translation of the original data, the turnout for the Drift 


makeshift had to match the Idle version. 
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On the other hand, the corresponding value for the Sway motif came out to 16.49832. In 
this way, the variance for the Sway template was slightly less than that for the Idle and Drift 
mockups. However, the difference was modest since the small edge in accuracy claimed by 
the seasonal framework was largely masked by the moil and noise of the stock market. 

Another straightforward way to vet a model is to compute the average size of the 
residuals. In the presence of positive and negative values, however, the errors will cancel each 
other out to a greater or lesser degree. As a remedy, we may inspect the absolute values of the 
miscues. The mean absolute error (MAE) for the Idle Model amounted to 3.210519. The 
corresponding values for the Drift and Sway templates were 3.124301 and 3.067265 
respectively. 

Based on these results, the mean amplitude of the residuals due to the Idle Model 
exceeded that of the Drift motif. Meanwhile the latter canard fared worse than the Sway 
template. In short, the seasonal model outranked the Drift makeshift which in turn outflanked 
the Idle mockup. 

A variation on the theme of dissonance concerns the median rather than the mean size of 
the errors. In this light, the median absolute deviation (MAD) denotes the median value of the 
absolute offsets from the median level of returns. The MAD values for the Idle and Drift 
Models were equal and amounted to 2.579433. As we noted earlier, the Efficient models 
displayed the same configuration of errors aside from a uniform offset stemming from the 
global uptrend of the stock market. Given the fixed correspondence between the error values, 
the pattern of misfits around their median levels was identical as well. For this reason, the pair 
of Efficient templates yielded the same value of MAD. 

On the other hand, the corresponding tally for the Sway motif was 2.472576. In this 


way, the seasonal template bettered the pair of Efficient mockups by a modest amount. 


5.3. Comparing the Measures of Scatter 


When a set of observations springs from a normal distribution, the median absolute deviation 
is apt to be smaller than the standard deviation. More precisely, the expected ratio is 1/1.4826 
which amounts to some 0.67449 (Leys, 2013; Ruppert, 2010). In this way, the MAD statistic 
tends to be slightly larger than two-thirds of the SD metric. 

By comparison, the mean absolute error is apt to exceed the MAD. More precisely, the 


expected value of the MAE is around 0.79788 times the SD when the sample comes from a 
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normal distribution (Geary, 1935). Roughly speaking, then, the MAE is wont to be some four- 
fifths of the standard deviation if the density function happens to be Gaussian. 

The relative sizes of these statistics are of course consistent with their intrinsic nature. 
We recall that the MAD yardstick makes use of the median level of the input data. Moreover, 
the median is a sturdy measure of centrality whose value is unaffected by a moderate number 
of outliers to the high side and/or low side. This staunch property applies to the median of the 
original dataset; and likewise for the ensuing residuals around their common median point. 

From a different angle, the MAD yardstick gauges the precision of a model by the 
median size of the offsets around the median point of the errors. In a parallel way, the SD of a 
set of errors reflects the precision of a model rather than its accuracy. 

By contrast, the MAE is a measure of accuracy in profiling the target system without 
making any adjustment for the errors around their own central point. As it happens, an 
accurate model is likely to be precise as well. On the other hand, a precise model merely has 
to yield consistent results whether or not the output happens to be accurate. 

To sum up, the mean absolute error conveys the average size of the discrepancy 
between the actual returns and the presaged values. That is, the MAE gauges the correctness 
of the model in depicting the reality. By contrast, the MAD measures the compactness of the 
errors around their own central value; the setup is similar for the SD. 

We note that the standard deviation reacts more to outliers than the mean absolute error. 
In computing the variance of a dataset as a preliminary step, the residuals are first squared 
then summed up, after which the ensuing tally is divided by the degrees of freedom. The same 
type of warpage applies to the standard deviation since the latter is merely the square root of 
the variance. As a result, the SD is highly sensitive to outsize residuals. 

The use of the SD befits an application where a big gaffe conveys a lot more punch than 
a small lapse. An example involves the occasional crash of the stock market despite the usual 
stir of modest moves to the upside and downside. From the standpoint of a sober investor, the 
dips of moderate size are customary and tolerable. On the other hand, big falls pose jarring 
shocks that ought to be avoided or at least minimized. From this perspective, the standard 
deviation of the residuals would be apter than the MAE or the MAD over the short term. 

Over the long haul, however, the average return throughout the entire stretch rather than 
the transient tremors along the way is the main determinant of portfolio performance and 
investment success. For this reason, we will continue to abide by the principle of robustness 


and thus favor the MAD over the MAE in the absence of compelling reasons to do otherwise. 
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In brief, the standard deviation is prone to exceed the mean absolute error which in turn 
tends to outweigh the median absolute deviation. The relative sizes of the three types of 
metrics may be presaged to some extent in the case of a normal distribution. On the other 
hand, many a process in the real world deviates from the Gaussian ogive. A good example lies 
in the stock market, where the price action resembles a normal distribution at first sight but 
not upon closer inspection. 

The next step involves a matchup of the MAE against the standard deviation for the trio 
of candidate models. The setup is similar for the MAD in relation to the SD. 

We begin with the pertinent ratios for the Idle template. For the dataset in hand, the 
quotient of MAE to SD for the Idle idol came out to 0.7819207. Meanwhile, the ratio of MAD 
to SD for the same template amounted to 0.6282200. These results did not vary greatly from 
the ratios to be expected in the case of a normal distribution. 

On the other hand, the absolute errors for the Drift template were smaller than those of 
the Idle version. In particular, the quotient of MAE to SD for the Drift motif was 0.7609225. 
The latter value fell below the corresponding tally of 0.7819207 as noted above for the Idle 
version. 

By contrast to the MAE, the MAD metric focuses on the dispersion of errors around a 
suitable measure of centrality. The same is true of the SD metric in its own way. From a 
geometric slant, the MAD and SD yardsticks are unaffected by a plain translation of the error 
vector. For this reason, the resulting tallies for the Idle and Drift Models were identical. To be 
precise, the ratio of MAD to SD amounted to 0.6282200 for both the Idle and Drift mockups. 

To recap, the error vectors for the Idle and Drift templates displayed the same 
configuration aside from a fixed offset. For this reason, the dispersion of misfits around their 
central values was identical by way of the MAD yardstick; and likewise for the SD metric. As 
a result, the ratio of MAD to SD for the Drift motif matched that of the Idle bromide. 

Finally, we come to the Sway Model. The ratio of MAE to SD for the cyclic template 
was 0.7551470. The quotient was somewhat less than the corresponding values we found 
earlier for the Drift and Idle canards. Meanwhile, the ratio of MAD to SD for the seasonal 
framework amounted to 0.6087370. The latter figure fell below the corresponding ratio for the 
Idle and Drift templates. From a different angle, the last two quotients diverged moderately 
from the values to be expected in the case of a Gaussian distribution. 

In sum, the value of MAE under a Gaussian ogive tends to equal roughly four-fifths that 
of SD while that of MAD lies around two-thirds. For our study, the corresponding ratios were 


slightly lower. The findings reflect the fact that the patterns in the stock market bear only a 
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passing resemblance to the normal distribution. The latter ogive conveys a maximum of 
entropy coupled with a minimum of information, thus evincing the ultimate in randomness. 
The results obtained above provide further proof that the path of the market resembles a 
Gaussian process in a superficial sense but not in a profound way. 

In keeping with the rift between the real and the ideal, the Drift model outclassed the 
Idle motif by a small margin when the mean absolute error served as the yardstick of 
performance. On the other hand, the median absolute deviation is merely a measure of 
precision rather than accuracy; since the error vectors due to the Efficient mockups differed 
only by a fixed offset, the residuals displayed the same value of MAD. Better yet, however, 
the Sway Model outranked both of the orthodox icons by way of the MAE as well as the 
MAD metrics. 


6. Overall Direction of Movement 


As we have seen, the binomial test makes only a few mild assumptions regarding the random 
process. Firstly, each trial gives rise to precisely one of two potential outcomes. Secondly, the 
probability mass function behind the process stays fixed. Thirdly, each trial is independent; 
that is, the turnout of any experiment has no impact on the others. We may combine these 
three precepts into a single statement as follows: every trial is independent of the others while 
they all display the same odds of one outcome versus its obverse. 

In addition to its sheer simplicity, the binomial test flaunts another fetching feature in 
hypothesis testing. The rare trait concerns the rigor of exact values rather than approximate 
ones in defining the critical region for rejection. 

As a counterpoint, consider a different tool that is widely employed in academic studies 
as well as practical assays; namely, Student’s t-test for checking whether or not a difference in 
the mean values of a couple of samples happens to be significant. The t-test relies on several 
assumptions, the most irksome of which are the following. First, the random process behind 
each sample should abide by a normal density function. Second, the variances for the two 
groups ought to be identical. Either or both of the foregoing constraints are often breached in 
complex domains including the real and financial markets. As a result, the t-test in spite of its 


popularity is much frailer than the binomial probe. 
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A variation on the theme lies in Welch’s t-test which presumes that the variances for the 
two groups may differ. Even here, though, the premise of a normal distribution for each 
cluster still applies. 

Moreover, the variance remains unknown unless the entire population of each group has 
been canvassed. If the datasets happen to be complete, then the difference of means may be 
checked directly without resorting to any form of dicey checkup such as a t-test. However, the 
variance is in general a crucial parameter that is merely a guesstimate based on a partial 
sample. 

In using a t-test to weigh the difference of means between two groups, we would have 
to perform the procedure even if we happen to know the other key properties of the 
underlying system; namely, the conformance of the samples to normal ogives along with the 
exact values of the variances. By contrast, there would be no need to conduct a binomial test 
at all if we knew the ratio of odds in advance. 

Yet the same type of bypass does not apply to the bulk of statistical tests. In practice, the 
underlying distribution is seldom known with certainty; and likewise for one or more 
parameters which play pivotal roles in the statistical assay. 

To recap, the binomial recipe requires no guesswork or presumption regarding the value 
of any parameter. From an empirical stance, the only unknown factor is the true ratio of odds 
which is the very thing being tested under the null hypothesis with the help of solid data. For 
this and other reasons, the nonparametric probe is applicable to any plausible form of 


probability mass function lying behind the binate process. 


6.1. Precise Peg of Ups and Downs 


The exact binomial test is de rigueur when the sample size is small, as in the case of n < 30 or 
so. From a larger stance, the technique due to its robustness represents a staunch way to check 
for randomness even for a sizable dataset such as the current sample. Against this backdrop, 
we begin by plumbing the pattern of ups and downs in price. 

According to the Efficient mantra along with the Random Walk takeoff, the moves in 
either direction arise with equal likelihood. In that case, the null hypothesis posits the chance 
of upturns to equal 0.5. In the absence of further information or supposition, the alternative 
premise entails a two-tailed test wherein the critical regions for rejection lie on the low side as 


well as the high end. 
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In the current study, the number of successes—that is, the turnups in price—came out to 
506 out of 840 trials. The ratio amounted to some 0.602 thus exceeding the anticipated value 
of one-half. 

From a different slant, the 95 percent confidence interval for the actual (unknown) 
fraction lay between 0.5683776 and 0.6356633: a range that excluded the putative value of 
0.5 pertaining to the null hypothesis. More to the point, the assay rendered a p-value of 
3.216x10-®. As a result, we could rebuff with a great deal of certitude the premise that the 
movements of the market were thoroughly random. That is, the Idle Model contravened the 
data at a high level of statistical significance. 

In the case of the Drift motif, the best forecast at each stage reflects the fixed uptrend 
over the long range. Other than that, the sloping template resembles the idling motif. In 
particular, the odds of leading or lagging the trendline are deemed to be equal. For this reason, 
the null hypothesis posits the fraction of positive residuals to be one-half. 

In actuality, the number of upturns equaled 450 out of 840 cases thus yielding a quotient 
of some 0.536. The latter proportion strayed by a notable amount from the expected value of 
0.5. Moreover, the p-value of 0.04172 fell below the usual level of 0.05 that serves as the 
threshold for ejecting the null hypothesis. 

In this way, the market displayed a small propensity to rise rather than fall in relation to 
the prolonged uptrend. We saw earlier that the Idle canard was squarely incompatible with the 
data. To a lesser degree, the Drift motif also flouted the facts according to the binomial test of 
directional moves. 

We now proceed to the Sway Model. The number of successes, or miscues to the 
upside, equaled 420 out of 840 cases. Remarkably, the resulting quotient precisely matched 
the expected fraction of 0.5. Moreover, the p-value of 1.0 proved the null hypothesis to be 
highly compelling. Put another way, the Sway Model deftly pinned down the midpoint of the 
movements in the following sense: the implicit forecast was as likely to overrun as to 
undershoot the actual return going forward. 

We have just assessed the pricing models in terms of the fraction of correct calls. In 
particular, a proof by contraposition affirmed that the Idle bromide runs afoul of the reality. 
Moreover, the Drift canard was also implausible to some degree. By contrast, the Sway Model 
turned in trustworthy results; the seasonal template bisected the prospects so well that the 


fraction of positive errors precisely matched the expected value of 0.5. 
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6.2. Roundup of Directional Bias 


As we have seen, the Idle Model presumes equals odds of the market moving to the upside or 
downside. However, a two-sided binomial test quashed the presumption to a high degree of 
certitude. By this result alone, we may discard the EMH and its offal. 

Meanwhile, the Drift patch-up yielded roughly the same number of overshoots as 
underruns. Even so, the template was somewhat unrealistic based on the imbalance of errors. 
The subdued form of rejection was hardly surprising. After all, the model was concocted 
precisely to circumvent a blatant defect of the Idle icon; namely, the existence of an uptrend 
over the long run coupled with a bias for upstrokes over downswings. 

Since the Drift mantra calls for a fixed forecast at each stage, a devotee of the bromide 
has no reason to buy or sell any shares of stock after acquiring an initial stake at the outset. 
This rationale gives rise to the buy-and-hold policy by which an investor procures a stake at 
the outset then retains the position indefinitely. By the same token, the Drift motif provides no 
further help to an enterprising player who hopes to outpace the market as a whole. 

In the case of the Sway Model, the number of errors to the upside versus downside 
happened to match up precisely. By this result alone, the supple model outclassed its stilted 
rivals. Put another way, the seasonal framework was fully compatible with the data rather than 
inconsistent with the facts as in the case of the Efficient mockups. 

To round up, this section has examined the fitness of the candidate models based on the 
frequency of errors to the upside versus downside regardless of the extent of the lapses. Each 
template was assessed separately to determine whether it conforms to the historical record. A 


different way to appraise the models is to check their performance against each other. 


7. Global Odds of Orientation 


The trio of pricing models may be pitted against each other in a pairwise fashion. In 
particular, one template outranks another if its tally of miscues undercuts its rival’s. According 
to the null hypothesis, the fraction of wins for any model against any other should equal one- 
half. On the other hand, we may have good reason to believe that a given model should 
outplay another; in that case, the alternative conjecture posits the fraction of wins to exceed 


one-half. 
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As an initial step, the Drift template was compared against the Idle idol. The null 
hypothesis presumed no difference in efficacy thus entailing a win ratio of 0.5. Based on the 
results from Section 6, however, the Drift makeshift was wont to outshine the Idle mockup. 
For this reason, the alternative thesis opined that the former should trounce the latter with a 
probability in excess of one-half. 

The Drift Model in fact triumphed in 478 cases out of 840 trials, thus scoring a win ratio 
of some 56.9%. Moreover the p-value of 3.538x10~> proved the advantage of the winner to be 
highly significant. 

The results of the previous section also suggest that the Sway Model should outstrip the 
Idle version. For this reason, the null hypothesis assumed no difference in accuracy while the 
alternate premise favored a fraction in excess of 0.5. A rundown of the data showed the Sway 
template to outperform the Idle motif in some 55.5% of the cases. Moreover, the p-value of 
0.0008374 marked an ample level of statistical significance. 

The next checkup involved a similar assay for the Sway framework against the Drift 
mockup. The former beat the latter in some 53.7% of the cases and yielded a p-value of 
0.01763. In this way, the cyclic template capped its trending rival at a respectable level of 
statistical clout. 

To sum up, the Drift makeshift outranked the Idle icon in terms of the odds of rising and 
falling. Better yet, the Sway template outflanked both of the Efficient send-ups with 
remarkable consistency. The advantage of the seasonal framework prevailed at ample levels 
of statistical significance throughout the timespan covered by the study. 

AS we saw in Section 6, the directional moves of the market were by themselves enough 
to purge the dogma of perfect efficiency. Moreover, this section has rendered similar results 
based on the performance of the pricing models against each other. 

From a larger stance, each type of appraisal may also be performed on smaller portions 
of the annual cycle. An example involves a marked tendency of the market to falter during the 


summer then advance in the autumn. 


8. Odds of Intra-year Waver 


In the previous two sections, the monthly moves of the stock market were canvassed across 
the entire timeline without paying heed to the errors by month over the course of the year. 


This section and the next, however, delve into a notable pattern within the annual cycle. 
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According to the Efficient mantra, the odds of an upturn versus a downcast should be uniform 
throughout the year. Yet here again, the reality takes exception to the illusion. 

We recall from Fig. 4 that the average return for October was positive while that for 
August was negative. In that case, two types of questions spring to mind. The first kind 
concerns the comparative odds of rising or falling between the two periods, while the second 
involves the relative sizes of the attendant moves. From a different angle, each type of query 
may refer to the matched pair of returns by year, or the standalone gains by month over the 
entire timeline without regard to the specific year of occurrence. 

The two kinds of properties are illustrated by the following queries. (1) Does the market 
advance in October more often than it does in August throughout the entire timeline? (2) Is 
the size of the return in October apt to surpass that of August in the same year? 

The first question above concerns the directionality of the moves regardless of the 
extent of the shifts or the year they transpire. To bring up a counterexample, suppose that the 
market in October were to score big gains in most cases. On the other hand, August might 
climb nearly as often but eke out only small upturns in the process; on the flip side, this period 
could perhaps endure a raft of beefy losses. In this scenario, the turnout for the two months is 
roughly equal in terms of the orientation of moves. This type of setup is explored in the 
current section. 

By contrast, the second type of query deals with the magnitude in conjunction with the 
direction of moves within the same year. In the latest example, October due to its ample gains 
has likely won the joust against August. The latter issue is addressed in Section 9. 

We know from Section 5 that October was a benign month on average while August 
was malign. For starters, then, a fitting task is to compare the frequency of upstrokes for the 
two periods. In this light, October racked up positive returns in 48 out of 70 years thus 
notching a win ratio of some 0.686. Meanwhile the corresponding score for August was 32 
wins out of 70 years thus yielding a hit rate of 0.457. 

As we have seen, the Efficient canon requires the odds of an upcast versus a downstroke 
to remain stable throughout the year. In particular, the chance of an uprise in October ought to 
match that of August without any tilt to the upside or downside. In that case, the null 
hypothesis posits the probability of upturns to be equal. Moreover, the alternative thesis 
asserts that any deviation is as likely to occur to the high side as the low end. The upshot is a 
two-tailed test of the proportions. 

Based on a binomial test, the difference in win ratios for the two periods bore a p-value 


of 0.0001584. In this way, October eclipsed August at a hefty level of statistical punch in 
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terms of the independent frequency of upturns throughout the entire timeline; that is, without 
pairing up the moves by the year of occurrence. Here lay another shaft of evidence against the 


EMH along with the Random Walk bromide. 


9. Magnitude of Intra-year Swings 


In the previous section, we compared the odds of an upturn versus a downcast for a couple of 
periods during the year; namely, August versus October. In the process, we focused on the 
direction of motion—whether upward or not—while ignoring the extent of the movement as 
well as the year of occurrence. 

On the other hand, we may also compare the two periods by gauging the size of the 
moves within each year. We recall from Section 5 that October chalked up better gains on 
average compared to August. In that case, we may ask whether October trumped August of 
the same year in a convincing way. 

In addressing this issue, we take the real-world approach by comparing the actual 
readings of the returns rather than merely their absolute values. To clarify this point, we turn 
to a numeric example. Suppose that the return in August in a given year happened to be 
—3.8% while that for October the same year amounted to —2.5%. In that case, we regard the 
latter turnout to be better for the investor than the former and thus declare October to be the 
winner that year. 

On the other hand, suppose that the signs of the payoffs were reversed. That is, August 
racked up +3.8% while October turned in +2.5%. In that case, August won the bout that year. 
In line with this cameo, the germane measure of performance is the raw return rather than 
merely the extent of the offset from the null payoff espoused by the Idle mockup. Based on 
this approach, we may inquire whether the odds of October beating August in a given year are 
better than random. From a formal stance, the null hypothesis asserts that the chance of a 
triumph for October is one-half. 

As we saw earlier, October enjoys upturns more often than does August. Even so, the 
advantage could be misleading in a way. For instance, a few huge losses to the downside 
might overwhelm a raft of sizable gains to the upside during the autumn. In this scenario, 


October rises more often but tends to suffer bigger drops when it falls. 
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In actuality, the vantage of October showed up in the median level as well as the mean 
value of the monthly returns. The dual edge suggests that the buoyant month may in fact have 
prevailed in a material way. 

With these clues in mind, we surmise that October was apt to trump August of the same 
year in terms of the size of the returns. As a result, the alternative hypothesis posits the 
fraction of wins to surpass one-half thereby requiring a one-tailed test to the upside. The win 
ratio for October came out to 45 out of 70 years, thus yielding a quotient of some 0.643. The 
distinction between the latter fraction and the default value of 0.5 displayed a p-value of 
0.01123. In this way, October prevailed against August in a credible fashion. 

To round up, an appreciable gap in performance showed up between certain months of 
the year. In particular, October was prone to outmatch August of the same year at a suasive 
level of statistical clout. From a larger stance, the existence of a palpable gap between a 
couple of months was by itself enough to scrub the whitewash of efficiency at an ample level 
of assurance. 

Even so, a note of caution is in order here. As a rule, some portion of the year is likely 
to outshine some other in any given sample due to the vagaries of a dicey process even in the 
absence of a weighty difference between the two periods. This and other niceties of the 


experimental design are examined further in the next section. 


10. Discussion 


The current study has fathomed one of the large-scale patterns in the marketplace; namely, the 
seasonal waves of the stock market. For a straightforward model of this kind, the formal 
argument requires nothing by way of prior experience with the subject matter nor advanced 
training in the techniques employed. 

The foremost point concerns the scope of knowledge—or dearth of such—needed by 
the reader. For instance, a student who has worked through any of the serious tutorials on the 
R system available on the Internet should easily grasp the program crafted for the case study. 
In acquiring the requisite skills, an example involves a popular video—suggested by a search 
engine—which is offered for free to all comers. A self-starter who has spent a few hours of 
hands-on practice with Base R, the kernel of the software platform, should have more than 


enough background to understand the code and follow the narrative. 


38 


On the other hand, the requirement on the statistical front is slightly more taxing. Yet, 
the background needed here is modest as well. More precisely, the material should be 
accessible to any student who has taken a first course in statistics whether they happen to 
major in business or psychology, biology or sociology—not to mention any of the technical 
fields such as physics or engineering. 

In short, nothing is needed by way of advanced knowledge in any subject ranging from 
finance and economics to programming and statistics. Rather, the main requirement is the 
patience to study the markets with care and descry the patterns that emerge. Granted, the 
signals in the financial realm are shifty and obscure. The primary reason for their dodgy 
nature: the telltales tend to vanish as an inadvertent but inevitable byproduct of the 
transactions as the participants take advantage of the conspicuous patterns. 

Despite the damping of signals, though, certain features persist in muted form due to the 
vagaries of the real and financial markets along with the foibles of human nature. Examples of 
the latter are legion in the world at large. For instance, every sentient person in the modern 
world knows that they should not eat to excess nor drive when drunk. Yet millions of souls 
brush aside the maxims, succumb to their whims, and pay dearly as a result. The same is true 
of the actors on the financial stage. The squelch of reason and the flare of passion show up, 
for instance, in the heat of a bubble or the freeze of a panic. 

Despite the roil of chance and noise in the world at large, however, the binomial test 
makes scant assumptions regarding the random process in play. The forthright technique is 
elemental and rugged since it requires nothing by way of iffy assumptions nor convoluted 


arguments in comparison to other forms of hypothesis testing. 


10.1. Approximate Technics in Practice 


In many a practical context, an approximate method is chosen over an exact technique. A 
common example involves the uptake of a t-test rather than a binomial probe in comparing the 
mean values of a couple of groups. 

Suppose that the size n of a given sample exceeds 25 or so while the probability p of 
success hovers around 0.5. Under these conditions, the outline of the binomial distribution 
does not differ greatly from the contour of a Gaussian ogive. For this reason, a popular 


expedient is to adopt a dubious but pliable substitute in the form of a t-test or a z-test. 
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The usual form of the t-test employs the Student-t distribution to profile the mean value 
of a set of variables, each of which conforms to a Gaussian ogive even though the underlying 
variance happens to be unknown to the investigator. To add to the muddle, however, the 
precise shape of the t-distribution depends on the number of degrees of freedom which in turn 
hinges on the size of the sample. 

On the other hand, the z-test employs the Gaussian distribution directly and applies to 
the case where the variance of the population is known beforehand. In most settings, however, 
knowledge of the variance is merely a supposition rather than an actuality. Yet the z-test is 
often employed in practice whenever the sample happens to be largish, meaning np(1 — p) > 6, 
even though the t-test would in fact be a better choice from a logical stance. 

Despite the usual expedient of employing rough and/or inapt methods, our case study 
has favored the exact binomial test. This tack was taken even though the dataset featured a 
sizable number of records which would serve as a splendid excuse to use rough-and-ready 
methods instead. 

The reasons for adopting the clean-cut technique were several. The first driver lay in the 
simplicity of the binomial test from a conceptual stance as well as a pragmatic slant. The 
second factor concerned the robustness of the procedure thanks to its abstention from shaky 
assumptions. The third facet involved the accuracy of the binal tool due to its use of sharp-cut 
numbers in vetting the null hypothesis. The fourth motive stemmed from the liberal scope of 
the methodology: in accommodating any reasonable form of random process, the duplex 
framework admits a plethora of prospective models thus stacking the odds in favor of the 
default hypothesis to the greatest possible extent. The fifth impetus sprang from the power to 
shatter the Efficient canon despite the latter’s blare that no technique, however advanced or 
sophisticated, can perform such a feat. 

All these properties combined to yield a final spur bearing on the potency of the results. 
When the binomial test in spite of its inclusive nature happens to reject a given premise, the 


verdict is highly noteworthy and convincing. 


10.2. Universal Scope of the Duplex Framework 


In the larger scheme of things, a dicey system must have the wherewithal to assume two or 


more distinct states. From the converse stance, an entity that displays only a single condition 
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happens to be a fixed and deterministic system rather than a loose and probabilistic one. For 
this reason, a pair of states comprises the minimal setup for a random process. 

In such a setting, the binomial routine represents the simplest mode of hypothesis 
testing for the simplest form of random system. Moreover, the technique imposes a minimal 
set of shackles on the random process in play. In particular, the constraints require only the 
independence of trials marked by a fixed ratio of odds for a pair of mutually exclusive 
outcomes. 

The stricture of independence represents an adjunct form of simplicity whereby no 
turnout has an impact on any of the others. On the flip side, if the outcomes were in fact 
interlinked then the prospective model as well as the statistical procedure would have to 
contend with the additional mode of complexity. 

Due to the dearth of assumptions, the duplex framework can accommodate any 
plausible form of random process. In this way, the binomial test is potent in terms of its broad 
scope of relevance. At the same time, though, the technique is feeble in the sense that it is 
unable to detect a plethora of aberrant cases. 

To dig deeper into this topic, we note that the outcomes for a random system of any kind 
may be divided into a pair of clusters, otherwise known as events. Suppose that the two events 
happen to be disjoint. In that case, the stage is set for a two-tone test. By this means, a binary 
schema may be superimposed on any random process by partitioning the entire set of 
outcomes into a pair of disjoint events that abide by fixed odds of occurrence. 

In short, a random process of any kind may be viewed through the lens of a binal 
framework. Due to the liberal—and in fact, universal—approach to modeling chancy systems, 
the binomial test is unable to pinpoint many a deviant case. On the other hand, this very 
weakness constitutes a special strength on the rare occasions when pointed results do arise. If 
the binary probe in spite of its liberal nature happens to oust the null hypothesis in a given 
application, then the uncommon judgment is apt to be sturdy and trusty rather than fragile and 


suspect. 


10.3. Frequency of Errors 


As we have seen, a variety of assays prove the Idle template to deny the reality of the 
marketplace. An exemplar concerns the asymmetry of the modeling errors in the upward 


versus downward headings. In the foregoing sections, we examined the monthly returns and 
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checked whether the data are compatible with the EMH and its spinouts. We found the actual 
behavior of the stock market to contravene the touted image of efficiency. 

According to the EMH and the Random Walk, the chance of an uprise matches that of a 
downstroke. As a corollary, the null hypothesis presumes the fraction of upturns to equal one- 
half. Unfortunately, a two-sided test using the binomial probe proved the premise to be false 
at a high level of significance. By this result alone, we may uproot the Idle idol and its 
outspreads. 

Meanwhile, the Drift Model yielded roughly the same number of overshoots as 
underruns after adjusting for the secular uptrend. Even so, the template was typecast as 
somewhat unrealistic due to an imbalance of the residuals. The overthrow of the Drift 
framework by a small margin was hardly surprising. After all, the model was concocted 
precisely to rectify a patent defect of the Idle mockup; namely, the existence of an uptrend in 
price over the long haul along with the propensity of the market to rise more than fall. 

The Drift motif entails a rigid forecast at each stage while taking into account only the 
secular trend. Based on this parody of the bourse, an investor has no reason to buy or sell any 
shares at all after taking up an initial stake at the outset. A direct fallout is the buy-and-hold 
ploy by which the doctrinaire acquires a stake then retains the position indefinitely. As a 
corollary, the Drift Model offers no practical help to an aspiring investor who hopes to 
outpace the bourse as a whole by timing the market. 

On the other hand, the ratio of miscues to the upside versus downside for the Sway 
Model was precisely the expected value. In this way, the flexile template bested its rigid 
rivals. By the same token, the limber framework was fully compatible with the data rather 


than inconsistent with the facts as in the case of the Efficient motifs. 


10.4. Constructive Methods Versus Expunging Schemes 


On one hand, a demonstration that the market is not purely random represents a solid and 
worthwhile result. Even so, the disproof by itself says little or nothing about how the market 
actually does behave. Herein lies the advantage of a constructive method such as the Sway 
Model that portrays the market at a hefty level of statistical punch. In particular, this instance 
of domain theory demonstrates that the bourse moves in tune with the seasons of the year in a 


subtle but persistent fashion despite the ceaseless turmoil along the way. 
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By contrast, the errors due to the Idle Model displayed a marked deviation from the 
premise of equal odds to the upside versus downside. Moreover, a binomial test proved the 
warped pattern to be substantive. In this way, the Idle Model failed to capture a vital feature of 
the target system; namely, the undulation of the market throughout the year. 

In comparison, the Drift framework yielded similar results but to a lesser degree. More 
precisely, the pattern of errors was sufficiently unbalanced to classify the trending motif as a 
false model of the market. 

Better yet, the Sway Model did an admirable job of unveiling the seasonal patterns. 
Moreover, the lithe template displayed no imbalance in portraying the moves to the upside 
versus downside. In this way, the cyclic motif captured a vibrant feature of the stock market. 
We could say that the Sway Model is simple but not simplistic. 

In brief, a primal goal of the case study was to refute the Efficient mythos by profiling 
the hops and dips of the stock market. From a larger stance, a purging method of this kind 
shows how the market does not behave by exposing an inconsistency between the putative 
model and the referent system. Even better, however, is a constructive tactic that reveals how 
the market does behave. A showpiece lies in the Sway template that captures the seasonal 
waves of the stock market. 

More generally, a cogent model coupled with a concise program can yield solid results 
from a theoretic stance as well as a pragmatic slant. For instance, the current study has cleared 
up a murky issue that has long stymied the mass of participants in the field ranging from wild- 
eyed rookies to jaded oldsters. Given the universal nature of the binomial test, the 
methodology presented here may be trained on knotty issues in diverse fields. The expansive 
approach befits all manner of systems rife with complexity ranging from mazy structures and 


flighty events to noisy data and hazy signals. 


10.5. Caveat for Intra-year Results 


The current study has fathomed the behavior of the stock market over the course of several 
generations. The inquest dealt with faint but persistent patterns attuned to the seasons of the 
year. Since the survey was far-ranging and evenhanded, we may infer that the inquest was 
impartial and the results well-founded. 

To be sure, the conclusions from this or any other workout could and should depend on 


the dataset employed. As a counterpoint, a discrepant result may arise in plumbing a subset of 
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the very same dataset employed in this study; or perhaps a buildup of observations to be made 
in the future. 

From a larger stance, the dependence of the conclusions on a given sample is a proviso 
that applies to any assay in any domain. Moreover, the potential for wayward results is 
precisely the reason for employing a battery of statistical tests rather than leaning on a single 
probe. 

To expand on this theme, we note that Sections 8 and 9 dealt with a subset of the annual 
cycle. Due to the focus on specific portions of the dataset, the analysis ran a greater risk of 
accidental bias due to the very nature of a dicey process. 

On one hand, the detailed screening of a prominent portion of a dataset is a routine and 
worthwhile practice in any domain. On the other hand, the narrow scope may easily lead to a 
spate of “data snooping” whereby the apparent features along with the conclusions happen to 
be accidental, opportunistic and contingent rather than intrinsic, persistent and decisive. 

When we compare the months of the year in a pairwise fashion, one period is apt to 
diverge from the average level in the positive direction while another does the opposite. For 
this reason, the chance of a glaring rift between a couple of groups may far exceed the 
prospect of a striking feature that prevails across the dataset as a whole. Due to the potential 
for spurious patterns, the results of the previous two sections deserve a measure of skepticism. 
We would in general expect some portion of the year to perform better than average and some 
other spell to fare worse, especially when the size of each cluster is small or moderate. 

For this reason, any conclusions obtained through such an assay should be treated with a 
healthy dose of dubiety. In keeping with this adage, the intra-year patterns in spite of their 
prominence and piquancy are excluded from the roundup of results to be presented in Section 


11. 


10.6. Progression of Patterns 


This report has spotlighted a train of seasonal waves in the stock market over the year. If we 
look at the big picture, patterns of all kinds are part and parcel of the real economy as well as 
the financial forum. That is not to say, of course, that any given feature is ordained to last 
forever. On the contrary, we would expect all manner of traits to mutate as the decades go by, 


and transform even more across the centuries. 
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A specimen lies in the shrinking role of the agrarian sector in the modern culture, and so 
too for the industrial branch. The comedown means, for instance, that the harvest on the farm 
—along with the cycle of sowing and reaping—has a waning impact on the tangible economy 
as well as the financial bazaar. The story is similar for the downtime of the auto industry 
during the summer when the bulk of carmakers retool their factories in August in preparation 
for the new model year. 

By the same token, patterns of all kinds abound in the service sector. A cyclic theme 
superimposed upon a long-lived trend inheres in the swelling role of leisure travel in the 
summer and winter as droves of vacationers flock to hotspots ranging from seaside haunts and 
health spas to mountain retreats and ski resorts. 

Meanwhile, a virtual example of a secular trend appears in the groundswell of software 
agents that drive the real and financial markets. For instance, a growing army of cyber drones 
trades virtual widgets ranging from stocks and bonds to futures and options. Amid the 
ferment, any tremors in price to the upside or downside are amplified as swarms of robots 
dash in and out of the trading pits within fractions of a second. On the flip side, though, the 
spasmodic moves of the plungers at the dawn of the millennium will doubtless be tempered to 
some degree by novel strains of smart agents that weigh the prospects over the long range and 
thereby counter the jerky moves of the mindless mob. 

To sum up, the real economy as well as the financial forum find themselves in a 
continual state of evolution and renewal wracked by occasional bouts of convulsion and 
upheaval. For this reason, the patterns in the marketplace are also fated to shift and morph 


with the passage of time. 


10.7. Supplementary Materials 


The statistical assays reported here were encoded in the R language. The resulting program 
has been uploaded to GitHub, an open repository of virtual resources on the Internet. The R 
script as well as the raw data are available at the following locale: “github.com/mintkitcom”. 
The program resides in the Code folder as a document in txt format under the title of 
SpxSeasonMonthPeg. Meanwhile the dataset abides in the Data node as a file in csv 


form under the rubric of SoxMonth1950-2020. 
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10.8. Future Directions 


As we have seen, the binomial test has scant cause in general to dislodge the null hypothesis 
due to the parsimony of a priori assumptions. In this sense, the bipartite scheme has low 
power: the probability of rejecting the default premise when the alternative happens to be true. 
Given the strong bias in favor of restraint and caution, along with the paucity of notable 
conclusions, the technique is widely shunned by investigators engaged in scientific research 
as well as pragmatic inquest. 

Despite its demure nature in general, though, the binomial routine has in the current 
context revealed conspicuous patterns in the penchant of the stock market to rise rather than 
fall. From a different slant, the trio of pricing models displayed marked differences in terms of 
the frequency as well as the magnitude of the errors they entail. 

In addition to these examples, the dogma of Efficiency beggars belief in manifold ways. 
A fundamental flaw lies in the lack of path dependence; namely, the notion that the prospect 
of rising or falling is independent of the prior history and current state of the market. 

A counterexample in the literature involved a survey of daily prices for the SPX wherein 
the dataset was extended in the backward direction to the beginning of 1928. In the forward 
heading, the time series ran till the end of August 1991. On the bright side, the investigation 
disproved the Random Walk canard by showing that the daily returns did not conform to the 
Gaussian ogive as required by the Efficient doctrine. 

On the glum side, though, a hasty trawl of the data may go overboard and dredge up far- 
fetched results. A showpiece arose in the foregoing study wherein the daily returns were found 
to be autocorrelated across a vast range of time scales. In particular, the observations 
displayed positive correlations for lags in excess of 2500 days; that is, stretches of trading 
days spanning a decade or more (Ding et al., 1993). In that case, for instance, the rise or fall 
of the SPX on a given day was apt to be followed by a move in the same direction 10 years 
afterward. 

Admittedly, the market does exhibit a dollop of serial correlation over the short run. As 
an example, a flash of good news may prod the SPX into spurting higher in stages over the 
span of several hours as eager investors rush in to buy shares with gusto. A showpiece in the 


converse direction crops up in the throes of an overreaction; for instance, a big fall on a given 
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day often draws contrary punters into the market the next day and thus spurs a rebound to 
some extent. 

On the other hand, there is no credible scenario by which the mass of investors would 
recall how they acted on a particular day 10 years earlier; and even if they did, there is no 
good reason to suppose that they would as a result act in the same way today. Instead, the 
most plausible explanation for the positive correlation stems from the fact that the SPX tends 
to rise much oftener than to fall. As a byproduct, each upstroke is usually followed by similar 
moves over all time frames ranging from the near term to the long haul. 

In such a lopsided environment, a high-strung probe may well reckon that a linkage 
exists across prolonged stretches even in the absence of an actual tie-up. In this and other 
ways, spurious results often arise when the presumed conditions behind the assays are tightly 
constrained. 

An exemplar involves a checkup grounded on the faulty premise of a normal 
distribution behind the data. False positives are bound to arise with troublesome frequency 
from the rampant custom of mapping confidence intervals based on the z-statistic or the t- 
statistic, each of which stands on the assumption of normality. In an errant practice that cuts 
across diverse disciplines, one portion of an assay may affirm that the dataset belies the 
normal distribution; yet the selfsame study dishes out ancillary conclusions based on the 
Gaussian ideal. In that case, the results are inconsistent from a logical stance and invalid from 
a pragmatic slant. 

By contrast, the binomial routine happens to be nonparametric and robust. Moreover, 
the procedure is highly inclined to err on the side of the null hypothesis due to the minim of 
assumptions. Most important of all, the methodology does not succumb to the vice of internal 
inconsistency revolving around the underlying distribution. Thanks to these virtues, the 
duplex framework may be employed with confidence in disparate fields where the patterns are 
so prominent that they distinguish themselves despite the detached mien of the binomial test. 

From a larger stance, a realistic model in any domain ought to reflect the crucial 
features including the nonlinear aspects of the target system ranging from abrupt causes to 
drawn-out effects. An example of the former is a disruption of oil supplies that prompts a 
chain reaction of blowouts throughout the economy. Meanwhile an instance of the latter is a 
plunge in the rate of unemployment which prods the stock market into an upsurge followed by 
a downswing thence to a cadence of dwindling waves. 

In line with these examples, the real and financial markets offer numerous opportunities 


for building concise, faithful and rugged models. To this end, the studies may employ a 
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wholesome blend of versatile tools ranging from the concepts of nonlinear dynamics to the 


techniques of nonparametric statistics. 


11. Conclusion 


Binary models can portray complex systems with the utmost of simplicity, clarity and efficacy 
in areas ranging from the dearth of prior premises to the staunchness of final conclusions. The 
vantage of the binomial scheme shows up, for instance, in expunging the welter of myths and 

misconceptions in the marketplace. 

As a backdrop, the fields of finance and economics play central roles in the allocation of 
limited resources such as raw materials and capital equipment, office space and human labor. 
Unfortunately, the ruling doctrine stands on a mire of myths and mirages, contradictions and 
non sequiturs. As a recourse, a cogent alternative lies in a trusty approach based on solid facts 
in lieu of hollow myths. For this purpose, a showcase lies in the seasonal patterns of the stock 
market by way of monthly returns throughout the year. 

According to the core credo of financial economics, the markets are so nimble as to 
absorb every nub of information with startling speed and react at once with perfect rationality. 
This sweeping doctrine, dubbed the Efficient Market Hypothesis, has come to dominate 
thought and action in areas ranging from asset valuation and financial forecasting to 
investment strategy and portfolio management. A direct outgrowth of the EMH is the Random 
Walk Model that pictures the market as an erratic system immune to prediction except perhaps 
as an occasional fluke. Another offshoot lies in the mantra of buying and holding a dicey asset 
forever as an unbeatable strategy for investment. 

In these and other ways, the financial forum is supposedly an ideal system whose 
performance in the aggregate cannot be surpassed by any of the participants regardless of 
their background, intelligence or diligence. The blanket premise and its spinouts cover the 
totality of elemental widgets such as common stocks as well as compound objects as in 
market benchmarks. 

Sadly, though, the charade of efficiency flouts the facts of life in plenteous ways. To 
underscore the chasm between the illusion and reality, the current study has employed only a 
minute fraction of the wealth of resources available to the general public. 

Firstly, the raw data comprised a mere drop in the sea of information freely available at 


the favorite portal of the investing public. Secondly, the candid portrait of seasonal patterns in 
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the marketplace is intuitive and comprehensible to any practitioner in the trenches or observer 
on the sidelines. Thirdly, the conceptual model was converted into a sparing subset of the R 
language, the most popular medium for data science in the context of academic research as 
well as pragmatic inquest. Fourthly, the computational analysis was performed via free and 
open-source software in the form of the R platform. Fifthly, the case study dealt with the stock 
market: the paragon of a flighty and unruly system that foils the mass of actors—ranging from 
part-time amateurs to full-time professionals—who strive to beat the passive benchmarks such 
as the S&P Index. Sixthly, the quantitative assay drew upon the simplest tool in hypothesis 
testing; namely, a statistical probe that is readily accessible to students at the undergraduate 
level regardless of their fields of specialization. 

From a formal stance, the canard of efficiency translates into the Idle and Drift Models 
of price action. In contrast, an honest portrait of the bourse lies in the Sway framework that 
captures the seasonal cycle throughout the year. The trio of templates was encoded in the R 
language then assessed against the monthly returns of the S&P benchmark. Despite its 
leanness, the Sway Model of intra-year waves outclassed the Idle and Drift mockups in divers 
ways. 

From a practical stance, the Efficient doctrine asserts that any bent of the market to the 
upside should match that to the downside. The binal schema sets the stage for a checkup of 
each model based on the frequency of moves in the upward versus downward headings. From 
this perspective, the main conclusions were as follows. 

According to the EMH along with the Random Walk Model, the chance of an uprise or 
a downturn should each equal 0.5. A binomial probe, however, voided the null hypothesis 
against a two-tailed test of the alternative premise at a highly significant level. The actual 
fraction of turnups was around 60.2%. Moreover, the p-value of 3.216x10 ° was far more 
acute and rejective than the usual threshold of 0.05 for voiding the null hypothesis. 

The Drift Model represents a patch-up for a blatant defect of the Idle icon, as 
highlighted by the foregoing result. The workaround takes account of the uptrend of the 
bourse over the long range by pegging the geometric average of the returns on a monthly 
basis. Based on this mockup, the proportion of upturns beyond the uptrend amounted to 
nearly 53.6%. A two-tailed test against the null hypothesis of balanced moves to the upside 
versus downside returned a p-value of 0.04172. That is, the binomial test managed by a slim 
margin to refute the Drift canard of equal odds of rising or falling after adjusting for the 


trendline. 
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By contrast, the Sway Model performed much better in profiling the odds of rising and 
falling. In fact, the number of upturns was 420 out of 840 cases thus precisely matching the 
expected value of 0.5. In this way, the viable scheme conformed to the null hypothesis of 
equal odds against the alternative premise marked by a two-sided test. Given the p-value of 
1.0, the limber template conformed readily to the reality of the stock market. Unlike the 
Efficient templates, the Sway motif did not come close to contradicting the data. 

To recap, the Idle Model fell grossly afoul of the reality due to the warpage of moves to 
the upside versus downside. Moreover, the Drift template failed by a small margin to abide by 
the notion of balanced movements around the trendline. Better yet, the Sway Model eclipsed 
both of the Efficient templates by bisecting the prospects of rising and falling with a great 
deal of verism. In these ways, a binal checkup of the price action was by itself enough to raze 
the cult of efficiency. For further evidence, however, the magnitude of the modeling errors 
was also scrutinized. 

The last few paragraphs focused on the market in terms of the direction of motion. Each 
assay was inherently a binate affair involving a move to the upside or downside without 
paying heed to the extent of the traversal. An alternate mode of appraisal centered on the size 
of the errors due to each model. More precisely, one template outranked another if the 
amplitude of the residuals on average happened to be less than its rival’s. The key results were 
as follows. 

Based on the first set of findings as described above, there was good reason to believe 
that the Drift Model would outshine the Idle mockup. For this reason, the null hypothesis of 
equal performance was matched against the alternative premise of greater accuracy by the 


Drift motif. The trending template trounced the idling motif in some 56.9% of the cases. 


Moreover, a one-tailed test returned a p-value of 3.538x10 >. In this way, the null hypothesis 
of equal performance was deposed in favor of the Drift template at a high level of 
significance. 

As to be expected, the Sway Model also trumped the Idle mockup. The fraction of wins 
amounted to 55.5% while the p-value of 0.0008374 marked a stout level of statistical import. 

The Sway template outmatched the Drift motif as well. The supple model triumphed in 
53.7% of the cases. As a result, the null hypothesis of equal performance was annulled and the 
vantage of the Sway template confirmed at a p-value of 0.01763. 

In these and other ways, the seasonal model of the stock market bested the Efficient 
motifs through multiple means at ample levels of statistical heft. The upshot was to roundly 


upend the ruling dogma of finance and economics. 
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To sum up, a trickle of incisive studies over the decades has revealed a variety of flaws 
in the Efficient Market Hypothesis and its byproducts. Due to the patchwork of precedents, 
exposing the jumble of myths and misconceptions behind the specious doctrine is not in itself 
a novel exercise. Rather, the progressive aspects of this report lie in the elemental nature of 
the arguments along multiple dimensions. The drawcards of the study include the simplicity 
of the seasonal model at a conceptual level, the ease of acquiring the information required, the 
parsimony of the dataset employed, the ubiquity of the software deployed, the universality of 
the experimental setup, and the strength of the conclusions at high levels of statistical 
significance. 

From a larger stance, the duplex framework showcased here represents the simplest way 
to model flighty systems regardless of their complexity. The crux of the methodology lies in 
the binomial test: the most basic tool in statistics for checking presumptions and drawing 
conclusions. The binal schema renders a medley of benefits ranging from the clarity of 


concepts and ease of usage to the soundness of inference and sureness of conclusions. 
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